SEQUENCE LISTING 

<110> ADNEY, WILLIAM S 
DING, SHI-YOU S 
VINZANT, TODD B. 
DECKER, STEPHEN R. 
HIMMEL, MICHAEL E. 

<120> THERMAL TOLERANT EXOGLUCANASE FROM ACIDOTHERMUS CELLULOLYTICUS 
<130> NREL 01-38 

<140> US 09/917,384 
<141> 2001-07-28 

<160> 11 

<170> Patentln version 3.2 

<210> 1 

<211> 1121 

<212> PRT 

<213> Acidothermus cellulolyticus 

<220> 

<221> mi sc_f eature 

<223> Full-length sequence of Guxl protein 

<400> 1 

Met Pro Gly Leu Arg Arg Arg Leu Arg Ala Gly lie Val Ser Ala Ala 
1 5 10 15 

Ala Leu Gly ser Leu Val Ser Gly Leu val Ala Val Ala Pro val Ala 
20 25 30 

His Ala Ala val Thr Leu Lys Ala Gin Tyr Lys Asn Asn Asp Ser Ala 
35 40 45 

Pro Ser Asp Asn Gin lie Lys Pro Gly Leu Gin Leu Val Asn Thr Gly 
50 55 60 

Ser ser Ser val Asp Leu Ser Thr Val Thr Val Arg Tyr Trp Phe Thr 
65 70 75 80 

Arg Asp Gly Gly Ser Ser Thr Leu Val Tyr Asn cys Asp Trp Ala Ala 
85 90 95 

Met Gly cys Gly Asn lie Arg Ala Ser Phe Gly ser Val Asn Pro Ala 
100 105 110 

Thr Pro Thr Ala Asp Thr Tyr Leu Gin Leu Ser Phe Thr Gly Gly Thr 
115 120 125 



Leu Ala Ala Gly Gly Ser Thr Gly Glu lie Gin Asn Arg val Asn 
130 135 140 



Ser Asp Trp ser Asn Phe Asp Glu Thr Asn Asp Tyr ser Tyr Gly Thr 
14 5 150 155 160 

Asn Thr Thr Phe Gin Asp Trp Thr Lys Val Thr val Tyr val Asn Gly 
165 170 175 

Val Leu val Trp Gly Thr Glu Pro Ser Gly Ala Thr Ala Ser Pro Ser 
180 185 190 

Ala Ser Ala Thr Pro Ser Pro Ser Ser Ser Pro Thr Thr Ser Pro Ser 
195 200 205 

Ser ser Pro ser Pro Ser ser ser Pro Thr Pro Thr Pro Ser ser ser 
210 215 220 

ser pro Pro Pro ser ser Asn Asp Pro Tyr lie Gin Arg Phe Leu Thr 
22 5 230 235 240 

Met Tyr Asn Lys lie His Asp Pro Ala Asn Gly Tyr Phe ser Pro Gin 
245 250 255 

Gly He Pro Tyr His ser val Glu Thr Leu lie val Glu Ala Pro Asp 
260 265 270 

Tyr Gly His Glu Thr Thr Ser Glu Ala Tyr ser Phe Trp Leu Trp Leu 
275 280 285 

Glu Ala Thr Tyr Gly Ala Val Thr Gly Asn Trp Thr Pro Phe Asn Asn 
290 295 300 

Ala Trp Thr Thr Met Glu Thr Tyr Met He Pro Gin His Ala Asp Gin 
305 310 315 320 

Pro Asn Asn Ala Ser Tyr Asn Pro Asn Ser Pro Ala Ser Tyr Ala Pro 
325 330 335 

Glu Glu Pro Leu Pro Ser Met Tyr Pro val Ala lie Asp ser Ser val 
340 345 350 

Pro val Gly His Asp Pro Leu Ala Ala Glu Leu Gin Ser Thr Tyr Glv 
355 360 365 

Thr Pro Asp lie Tyr Gly Met His Trp Leu Ala Asp val Asp Asn lie 
370 375 380 



Tyr Gly Tyr Gly Asp ser Pro Gly Gly Gly Cys Glu Leu Gly Pro Ser 
385 390 395 400 



Ala Lys Gly val ser Tyr lie Asn Thr Phe Gin Arg Gly Ser Gin Glu 
405 410 415 

Ser val Trp Glu Thr val Thr Gin Pro Thr Cys Asp Asn Gly Lys Tyr 
420 425 430 

Gly Gly Ala His Gly Tyr val Asp Leu Phe He Gin Gly Ser Thr Pro 
435 440 445 

Pro Gin Trp Lys Tyr Thr Asp Ala Pro Asp Ala Asp Ala Arg Ala Val 
450 455 460 

Gin Ala Ala Tyr Trp Ala Tyr Thr Trp Ala Ser Ala Gin Gly Lys Ala 
465 470 475 480 

Ser Ala lie Ala Pro Thr He Ala Lys Ala Ser Gin Thr Gly Asp Tvr 
485 490 495 

Leu Arg Tyr Ser Leu Phe Asp Lys Tyr Phe Lys Gin Val Gly Asn cvs 
500 505 510 

Tyr Pro ,Ala ser Ser Cys Pro Gly Ala Thr Gly Arg Gin Ser Glu Thr 
515 520 525 



Tyr Leu lie Gly Trp Tyr Tyr Ala Trp Gly Gly ser Ser Gin Gly Trp 



535 



540 



Ala Trp Arg He Gly Asp Gly Ala Ala His Phe Gly Tyr Gin Asn Pro 
54 5 550 555 560 

Leu Ala Ala Trp Ala Met Ser Asn val Thr Pro Leu He Pro Leu ser 
565 570 575 

Pro Thr Ala Lys Ser Asp Trp Ala Ala ser Leu Gin Arg Gin Leu Glu 
580 585 590 

Phe Tyr Gin Trp Leu Gin Ser Ala Glu Gly Ala He Ala Gly Gly Ala 
595 600 605 

Thr Asn ser Trp Asn Gly Asn Tyr Gly Thr Pro Pro Ala Gly Asp Ser 
610 615 620 



Thr Phe Tyr Gly Met Ala Tyr Asp Trp Glu Pro val Tyr His Asp pro 



630 



635 



640 



Pro ser Asn Asn Trp phe Gly Phe Gin Ala Trp ser Met Glu Arq val 
645 650 655 



Ala Glu Tyr Tyr Tyr val Thr Gly Asp Pro Lys Ala Lys Ala Leu Leu 
660 665 670 

Asp Lys Trp val Ala Trp Val Lys Pro Asn val Thr Thr Gly Ala Ser 
675 680 685 

Trp ser He Pro ser Asn Leu Ser Trp ser Gly Gin Pro Asp Thr Trp 
690 695 700 

Asn Pro ser Asn Pro Gly Thr Asn Ala Asn Leu His Val Thr lie Thr 
705 710 715 720 

Ser ser Gly Gin Asp val Gly val Ala Ala Ala Leu Ala Lys Thr Leu 
725 730 735 

Glu Tyr Tyr Ala Ala Lys Ser Gly Asp Thr Ala Ser Arg Asp Leu Ala 
740 745 750 

Lys Gly Leu Leu Asp ser Met Trp Asn Asn Asp Gin Asp Ser Leu Gly 
755 760 765 

Val ser Thr Pro Glu Thr Arg Thr Asp Tyr Ser Arg phe Thr Gin val 
770 775 780 

Tyr Asp Pro Thr Thr Gly Asp Gly Leu Tyr lie Pro Ser Gly Trp Thr 
785 790 795 800 

Gly Thr Met Pro Asn Gly Asp Gin He Lys Pro Gly Ala Thr Phe Leu 
805 810 815 

ser He Arg ser Trp Tyr Thr Lys Asp Pro Gin Trp ser Lys Val Gin 
820 825 830 

Ala Tyr Leu Asn Gly Gly Pro Ala Pro Thr Phe Asn Tyr His Arq Phe 
835 840 845 

Trp Ala Glu Ser Asp Phe Ala Met Ala Asn Ala Asp Phe Gly Met Leu 
850 855 860 

Phe Pro ser Gly Ser pro Ser Pro Thr Pro ser Pro Thr Pro Thr Ser 
865 870 875 880 

ser pro ser Pro Thr pro Ser Ser Ser Pro Thr Pro Ser Pro ser Pro 
885 890 895 

ser Pro Thr Gly Asp Thr Thr Pro Pro ser val Pro Thr Gly Leu Gin 
900 905 910 



val Thr Gly Thr Thr Thr Ser Ser val ser Leu Ser Trp Thr Ala Ser 



.915 920 925 

Thr Asp Asn val Gly val Ala His Tyr Asn Val Tyr Arg Asn Gly Thr 
930 935 940 

Leu val Gly Gin Pro Thr Ala Thr Ser Phe Thr Asp Thr Gly Leu Ala 
945 950 955 960 

Ala Gly Thr ser Tyr Thr Tyr Thr val Ala Ala Val Asp Ala Ala Gly 
965 970 975 

Asn Thr ser Ala Gin ser Phe Ala Gly Asp ser Asp Asp Gly He Ala 
980 985 990 

Val Ala Ser Pro ser pro Ser Pro Thr Pro Thr ser Ser Pro Ser Pro 
995 1000 1005 

Thr Ser Pro Tnr Pro Ser Pro Tnr s er Thr Ser Gly Ala Ser 
1010 1015 1020 

cys Thr Ala Thr Tyr val val Asn ser Asp Trp Gly ser Gly Phe 
1025 1030 1035 

Thr T!l r « Thr Val Tnr Val Thr Asn Thr Gly Thr Arg Ala Thr Ser 
1040 1045 1050 

Gl y Jn? r Thr Val Thr Trp ser Phe Ala Gly Asn Gin Thr val Thr 
1055 1060 1065 

Asn Tv I„ Tr P Asn Thr Ala Leu Tnr Gin Ser Gly Lys ser val Thr 
1070 1075 1080 

Ala Asn Leu Ser Tvr Asn Asn Val He Gin Pro Gly Gin ser 
1085 1090 1095 

Thr Tt™ Phe Gly phe Asn Gl V Ser T y r Ser Gly Thr Asn Thr Ala 
1100 1105 1110 

Pro Thr Leu ser Cys Thr Ala Ser 
1115 1120 

<210> 2 

<211> 3365 

<212> DNA 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> Guxl full-length coding sequence 



<400> 2 . 

atgccaggat tacgacggcg actccgcgcc ggtatcgtct cggcggcggc gttggggtcg 60 

ctggttagcg ggctcgttgc cgtcgcacca gtcgcgcacg cggcggtgac tctcaaagcg 120 

cagtataaga acaatgattc ggcgccgagt gacaaccaga tcaaaccggg tctccagttg 180 

gtgaataccg ggtcgtcgtc ggtggatttg tcgacggtga cggtgcggta ctggttcacc 240 

cgggatggtg ggtcgtcgac actggtgtac aactgtgact gggcggcgat ggggtgtggg 300 

aatatccgcg cctcgttcgg ctcggtgaac ccggcgacgc cgacggcgga cacctacctg 360 

cagttgtcgt tcactggtgg aacgttggcc gctggtgggt cgacgggtga gattcaaaac 420 

cgggtgaata agagtgactg gtcgaacttt gatgagacca atgactactc gtatgggacg 480 

aacaccacct tccaggactg gacgaaggtg acggtgtacg tcaacggcgt gttggtctgg 540 

gggaccgaac cgtccggagc gacggcgtct ccatccgcgt cggcgacgcc cagcccgtcc 600 

agttcaccga ccacgagtcc gagttcgtcc ccgtcgccga gcagcagccc gacgccgaca 660 

ccgagcagct cgtcgccgcc ccgtcgtcca acgacccgta catccagcgg ttcctcacga 720 

tgtacaacaa gattcacgac ccagcgaacg gctacttcag cccgcaggga attccctacc 780 

actcggtaga aacgctcatc gttgaggcac cggactacgg gcacgagaca acttcggagg 840 

cgtacagctt ctggctctgg ctcgaagcga cgtacggcgc agtgaccggc aactggacgc 900 

cgttcaacaa cgcctggacg acgatggaaa cgtacatgat cccgcagcac gcggaccagc 960 

cgaacaacgc gtcgtacaac cccaacagcc cggcgtcgta cgctccggaa gagccgctgc 1020 

ccagcatgta cccggttgcc atcgacagca gcgtgccggt tgggcacgac ccgctcgccg 1080 

ccgaattgca gtcgacgtac ggcactccgg acatttacgg catgcactgg ctggccgacg 1140 

ttgacaacat ctacggatac ggcgacagcc ccggcggtgg ttgcgaactc ggtccttccg 1200 

ctaagggcgt ctcctacatc aacacattcc agcgcggctc gcaggagtcc gtctgggaga 1260 

cggtcaccca gccgacgtgc gacaacggca agtacggtgg ggcgcacggc tacgtcgacc 1320 

tgttcatcca gggttcgacg ccgccgcagt ggaagtacac cgatgccccg gacgccgacg 1380 

cccgtgccgt ccaggctgcg tactgggcct acacctgggc atcggcgcag ggcaaggcaa 1440 

gcgcgattgc cccgacgatc gccaaggcga gccaaaccgg cgactacctg cggtactcgc 1500 

tctttgacaa gtacttcaag caggtcggca actgctaccc ggccagctcc tgccctggag 1560 

caaccggacg ccagagcgag acctacctga tcggctggta ctacgcctgg ggcggctcaa 1620 

gccaaggctg ggcctggcgc attggtgacg gcgccgcgca cttcggctac cagaatccgc 1680 

ttgccgcgtg ggcgatgtcg aacgtgacac cgctcattcc gctctcgccc acggcaaaga 1740 

gcgactgggc ggcgagcttg cagcgccagc tggagttcta ccagtggttg caatccgcgg 1800 

aaggagccat tgcgggcggc gccaccaaca gctggaacgg caattacggg accccgccgg 1860 

ccggagactc gaccttctac ggcatggcgt acgactggga gccggtctac cacgacccgc 1920 



i 



cgagcaacaa ctggttcggc ttccaggcgt ggtccatgga acgggttgcc gagtactact 1980 

acgtcaccgg cgacccgaag gccaaggcgc tgctcgacaa gtgggtcgca tgggtgaagc 2040 

cgaatgtcac caccggtgcc tcatggtcga ttccgtcgaa tttgtcctgg agcggccaac 2100 

cggatacctg gaatccgagc aacccaggaa cgaatgccaa cctgcacgtg accatcacgt 2160 

cgtccgggca ggacgtcggt gttgccgcgg cgctcgcgaa gacactcgag tactacgcgg 2220 

caaaatccgg cgatacggcc tcgcgcgacc tcgcgaaggg attgctcgac tccatgtgga 2280 

acaacgacca ggacagcctc ggtgtgagca caccggagac gcggaccgac tactctcggt 2340 

tcactcaggt gtacgacccg acgactggtg acggcctcta catcccgtcg ggttggacgg 2400 

ggaccatgcc caacggtgac caaatcaagc cgggtgcgac cttcctgagc atccggtcct 2460 

ggtacaccaa ggatccgcag tggtcgaagg tgcaggcgta cctcaacggc gggcctgctc 2520 

cgacgttcaa ctaccaccgg ttctgggcgg agtccgactt cgcgatggcg aacgccgatt 2580 

ttggcatgct cttcccatcc gggtcgccca gcccgacccc gagcccgact ccgacgtcgt 2640 

ccccgagccc gactccgagc agctcgccga cgccgtcgcc cagcccgtca ccgaccggcg 2700 

acaccacgcc gccgagcgtg ccgacgggtc ttcaggtcac cgggacaacg acgtcgtccg 2760 

tgtcgctcag ctggaccgcg tccaccgaca acgtcggcgt cgcgcactac aacgtgtacc 2820 

gaaacggcac gctggtgggt cagccgacag cgacgtcgtt cacggacacc ggcctggctg 2880 

ctggcacgtc gtacacgtac acagtggcgg ccgttgatgc ggccggtaac acgtcggcgc 2940 

agagcttcgc cggtgacagc gacgacggca tcgccgtcgc gagcccgtcg ccgagcccga 3000 

ctccgacgtc gtccccgagc ccaacgccgt cgccgacacc gtcaccgacg tccaccagcg 3060 

gcgcatcgtg cactgctacc tacgttgtca atagcgactg gggtagcggc ttcacgacaa 3120 

ccgtgaccgt gacgaacacc ggcaccaggg ccaccagtgg ctggacggtc acgtggagct 3180 

ttgccggtaa tcagacggtc accaactact ggaacaccgc gctgacgcaa tccggaaagt 3240 

cggtgaccgc aaagaacctg agttacaaca acgtcatcca acctggtcag tcgacgacct 3300 

ttggattcaa cggaagttac tcaggaacaa acaccgcgcc gacgctcagc tgcacggcaa 3360 

gctga 3365 

<210> 3 
<211> 34 
<212> PRT 

<213> Acidothermus cellulolyticus 

<220> 

<2 2 1> mi sc_f eatu re 

<223> Potential signal peptide of Guxl 
<400> 3 

Met Pro Gly Leu Arg Arg Arg Leu Arg Ala Gly lie val ser Ala Ala 
15 10 15 



Ala Leu Gly ser Leu Val Ser Gly Leu Val Ala Val Ala Pro Val Ala 
20 25 30 

His Ala 



<210> 4 

<211> 153 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> CBD III of Guxl 

<400> 4 

Ala val Thr Leu Lys Ala Gin Tyr Lys Asn Asn Asp Ser Ala Pro Ser 
15 10 15 

Asp Asn Gin lie Lys Pro Gly Leu Gin Leu Val Asn Thr Gly ser Ser 
20 25 30 

Ser val Asp Leu Ser Thr val Thr val Arg Tyr Trp Phe Thr Arg Asp 
35 40 45 

Gly Gly ser ser Thr Leu Val Tyr Asn Cys Asp Trp Ala Ala Met Gly 
50 55 60 

cys Gly Asn lie Arg Ala Ser Phe Gly Ser Val Asn Pro Ala Thr Pro 
65 70 75 80 

Thr Ala Asp Thr Tyr Leu Gin Leu Ser Phe Thr Gly Gly Thr Leu Ala 
85 90 95 

Ala Gly Gly Ser Thr Gly Glu lie Gin Asn Arg val Asn Lys Ser Asp 
100 105 110 

Trp ser Asn Phe Asp Glu Thr Asn Asp Tyr Ser Tyr Gly Thr Asn Thr 
115 120 125 

Thr Phe Gin Asp Trp Thr Lys val Thr val Tyr Val Asn Gly Val Leu 
130 135 140 

val Trp Gly Thr Glu Pro Ser Gly Ala 
145 150 

<210> 5 
<211> 640 
<212> PRT 



<213> Acidothermus cellulolyticus 

<220> 

<221> mi s cofeature 

<223> GH48 domain of Guxl 

<400> 5 

Asn Asp Pro Tyr lie Gin Arg Phe Leu Thr Met Tyr Asn Lys lie His 
1 5 10 15 

Asp Pro Ala Asn Gly Tyr Phe Ser Pro Gin Gly lie Pro Tyr His Ser 
20 25 30 

val Glu Thr Leu lie Val Glu Ala Pro Asp Tyr Gly His Glu Thr Thr 
35 40 45 

Ser Glu Ala Tyr Ser Phe Trp Leu Trp Leu Glu Ala Thr Tyr Gly Ala 
50 55 60 

val Thr Gly Asn Trp Thr Pro Phe Asn Asn Ala Trp Thr Thr Met Glu 
65 70 75 80 

Thr Tyr Met lie Pro Gin His Ala Asp Gin Pro Asn Asn Ala Ser Tyr 
85 90 95 

Asn Pro Asn Ser Pro Ala Ser Tyr Ala Pro Glu Glu Pro Leu Pro Ser 
100 105 110 

Met Tyr Pro Val Ala lie Asp Ser Ser Val Pro val Gly His Asp Pro 
115 120 125 

Leu Ala Ala Glu Leu Gin Ser Thr Tyr Gly Thr Pro Asp lie Tyr Gly 
130 135 140 

Met His Trp Leu Ala Asp Val Asp Asn lie Tyr Gly Tyr Gly Asp Ser 
145 150 155 160 

Pro Gly Gly Gly Cys Glu Leu Gly Pro Ser Ala Lys Gly Val Ser Tyr 
165 170 175 

lie Asn Thr Phe Gin Arg Gly Ser Gin Glu Ser val Trp Glu Thr Val 
180 185 190 

Thr Gin Pro Thr Cys Asp Asn Gly Lys Tyr Gly Gly Ala His Gly Tyr 
195 200 205 

val Asp Leu Phe lie Gin Gly Ser Thr Pro Pro Gin Trp Lys Tyr Thr 
210 215 220 



Asp Ala Pro Asp Ala Asp Ala Arg Ala val Gin Ala Ala Tyr Trp Ala 
225 230 235 240 

Tyr Thr Trp Ala Ser Ala Gin Gly Lys Ala Ser Ala He Ala Pro Thr 
245 250 255 

lie Ala Lys Ala ser Gin Thr Gly Asp Tyr Leu Arg Tyr Ser Leu Phe 
260 265 270 

Asp Lys Tyr Phe Lys Gin val Gly Asn Cys Tyr Pro Ala Ser Ser Cys 
275 280 285 

Pro Gly Ala Thr Gly Arg Gin Ser Glu Thr Tyr Leu He Gly Trp Tyr 
290 295 300 

Tyr Ala Trp Gly Gly Ser ser Gin Gly Trp Ala Trp Arg He Gly Asp 
305 310 315 320 

Gly Ala Ala His Phe Gly Tyr Gin Asn Pro Leu Ala Ala Trp Ala Met 
325 330 335 

Ser Asn val Thr Pro Leu lie Pro Leu Ser Pro Thr Ala Lys Ser asd 
340 345 350 

Trp Ala Ala Ser Leu Gin Arg Gin Leu Glu Phe Tyr Gin Trp Leu Gin 
355 360 365 

Ser Ala Glu Gly Ala lie Ala Gly Gly Ala Thr Asn Ser Trp Asn Gly 
370 375 380 

Asn Tyr Gly Thr Pro Pro Ala Gly Asp ser Thr Phe Tyr Gly Met Ala 
385 390 395 400 

Tyr Asp Trp Glu Pro val Tyr His Asp Pro Pro Ser Asn Asn Trp Phe 
405 410 415 

Gly Phe Gin Ala Trp ser Met Glu Arg val Ala Glu Tyr Tyr Tvr Val 
420 425 430 

Thr Gly Asp Pro Lys Ala Lys Ala Leu Leu Asp Lys Trp Val Ala Trp 
435 440 445 

val Lys Pro Asn val Thr Thr Gly Ala ser Trp ser He Pro Ser Asn 
450 455 460 

Leu ser Trp ser Gly Gin Pro Asp Thr Trp Asn Pro Ser Asn Pro Gly 
465 470 475 480 



Thr Asn Ala Asn Leu His val Thr lie Thr ser Ser Gly Gin Asp Val 



485 490 495 

Gly val Ala Ala Ala Leu Ala Lys Thr Leu Glu Tyr Tyr Ala Ala Lys 
500 505 510 

Ser Gly Asp Thr Ala Ser Arg Asp Leu Ala Lys Gly Leu Leu Asp Ser 
515 520 525 

Met Trp Asn Asn Asp Gin Asp Ser Leu Gly Val Ser Thr Pro Glu Thr 
530 535 540 

Arg Thr Asp Tyr ser Arg Phe Thr Gin Val Tyr Asp Pro Thr Thr Gly 
545 550 555 560 

Asp Gly Leu Tyr lie Pro Ser Gly Trp Thr Gly Thr Met Pro Asn Gly 
565 570 575 

Asp Gin lie Lys Pro Gly Ala Thr Phe Leu Ser lie Arg ser Trp Tyr 
580 585 590 

Thr Lys Asp Pro Gin Trp Ser Lys Val Gin Ala Tyr Leu Asn Gly Gly 
595 600 605 

Pro Ala Pro Thr Phe Asn Tyr His Arg Phe Trp Ala Glu Ser Asp Phe 
610 615 620 

Ala Met Ala Asn Ala Asp Phe Gly Met Leu Phe Pro Ser Gly Ser Pro 
625 630 635 640 

<210> 6 

<211> 85 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> FN ill domain of Guxl 

<400> 6 

Asp Thr Thr Pro Pro Ser val Pro Thr Gly Leu Gin val Thr Gly Thr 
1 5 10 15 

Thr Thr ser Ser val Ser Leu Ser Trp Thr Ala Ser Thr Asp Asn Val 
20 25 30 

Gly val Ala His Tyr Asn val Tyr Arg Asn Gly Thr Leu Val Gly Gin 
35 40 45 

Pro Thr Ala Thr Ser Phe Thr Asp Thr Gly Leu Ala Ala Gly Thr Ser 
50 55 60 



\ 



Tyr Thr Tyr Thr Val Ala Ala val Asp Ala Ala Gly Asn Thr Ser Ala 
65 70 75 80 

Gin Ser Phe Ala Gly 
85 



<210> 7 

<211> 101 

<212> PRT 

<213> Acidothermus cellulolyticus 

<220> 

<221> misc_feature 

<223> CBD II domain of Guxl 

<400> 7 

Gly Ala Ser Cys Thr Ala Thr Tyr Val val Asn Ser Asp Trp Gly Ser 
15 10 15 



Gly Phe Thr Thr Thr val Thr Val Thr Asn Thr Gly Thr Arg Ala Thr 
20 25 30 



Ser Gly Trp Thr val Thr Trp Ser Phe Ala Gly Asn Gin Thr Val Thr 
35 40 45 

Asn Tyr Trp Asn Thr Ala Leu Thr Gin Ser Gly Lys Ser val Thr Ala 
50 55 60 

Lys Asn Leu Ser Tyr Asn Asn Val lie Gin Pro Gly Gin Ser Thr Thr 
65 70 75 80 

Phe Gly Phe Asn Gly Ser Tyr Ser Gly Thr Asn Thr Ala Pro Thr Leu 
85 90 95 

Ser cys Thr Ala Ser 
100 



<210> 8 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic histidine tag 



<220> 

<221> misc_feature 
<223> Synthetic His Tag 



<400> 



8 



His His His His His His 
1 5 

<210> 9 
<211> 638 
<212> PRT 

<213> Acidothermus cellulolyticus 

<220> 

<221> mi sc_f eature 

<223> GH48 domain of Acdothermus cellulolyticus 

<400> 9 

Pro Tyr lie Gin Arg Phe Leu Thr Met Tyr Asn Lys lie His Asp Pro 
15 10 15 

Ala Asn Gly Tyr Phe Ser Pro Gin Gly lie Pro Tyr His Ser Val Glu 
20 25 30 

Thr Leu lie val Glu Ala Pro Asp Tyr Gly His Glu Thr Thr Ser Glu 
35 40 45 

Ala Tyr Ser Phe Trp Leu Trp Leu Glu Ala Thr Tyr Gly Ala Val Thr 
50 55 60 

Gly Asn Trp Thr Pro Phe Asn Asn Ala Trp Thr Thr Met Glu Thr Tyr 
65 70 75 80 

Met lie Pro Gin His Ala Asp Gin Pro Asn Asn Ala Ser Tyr Asn Pro 
85 90 95 

Asn Ser Pro Ala Ser Tyr Ala Pro Glu Glu Pro Leu Pro Ser Met Tyr 
100 105 110 

Pro Val Ala lie Asp Ser Ser Val Pro Val Gly His Asp Pro Leu Ala 
115 120 125 



Ala Glu Leu Gin Ser Thr Tyr Gly Thr Pro Asp lie Tyr Gly Met His 
130 135 140 

Trp Leu Ala Asp Val Asp Asn lie Tyr Gly Tyr Gly Asp ser Pro Gly 
145 150 155 160 

Gly Gly Cys Glu Leu Gly Pro Ser Ala Lys Gly Val Ser Tyr lie Asn 
165 170 175 

Thr Phe Gin Arg Gly Ser Gin Glu Ser val Trp Glu Thr val Thr Gin 
180 185 190 

pro Thr cys Asp Asn Gly Lys Tyr Gly Gly Ala His Gly Tyr val Asp 



195 



200 



205 



Leu Phe lie Gin Gly Ser Thr Pro Pro Gin Trp Lys Tyr Thr Asp Ala 
210 215 220 

Pro Asp Ala Asp Ala Arg Ala Val Gin Ala Ala Tyr Trp Ala Tyr Thr 
225 230 235 240 

Trp Ala ser Ala Gin Gly Lys Ala Ser Ala He Ala Pro Thr lie Ala 
245 250 255 

Lys Ala Ser Gin Thr Gly Asp Tyr Leu Arg Tyr ser Leu Phe Asp Lys 
260 265 270 

Tyr Phe Lys Gin val Gly Asn Cys Tyr Pro Ala Ser Ser Cys Pro Gly 
275 280 285 

Ala Thr Gly Arg Gin Ser Glu Thr Tyr Leu lie Gly Trp Tyr Tyr Ala 
290 295 300 

Trp Gly Gly ser Ser Gin Gly Trp Ala Trp Arg lie Gly Asp Gly Ala 
305 310 315 320 

Ala His Phe Gly Tyr Gin Asn Pro Leu Ala Ala Trp Ala Met Ser Asn 
325 330 335 

val Thr Pro Leu lie Pro Leu Ser Pro Thr Ala Lys Ser Asp Trp Ala 
340 345 350 

Ala Ser Leu Gin Arg Gin Leu Glu Phe Tyr Gin Trp Leu Gin Ser Ala 
355 360 365 

Glu Gly Ala lie Ala Gly Gly Ala Thr Asn Ser Trp Asn Gly Asn Tyr 
370 375 380 

Gly Thr Pro Pro Ala Gly Asp ser Thr Phe Tyr Gly Met Ala Tyr asd 
385 390 395 400 

Trp Glu Pro val Tyr His Asp Pro Pro ser Asn Asn Trp Phe Gly Phe 
405 410 415 

Gin Ala Trp ser Met Glu Arg Val Ala Glu Tyr Tyr Tyr val Thr Gly 
420 425 430 

Asp Pro Lys Ala Lys Ala Leu Leu Asp Lys Trp val Ala Trp Val Lys 
435 440 445 



Pro Asn val Thr Thr Gly Ala Ser Trp Ser lie Pro ser Asn Leu ser 
450 455 460 



Trp ser Gly Gin Pro Asp Thr Trp Asn Pro Ser Asn Pro Gly Thr Asn 
465 470 475 480 

Ala Asn Leu His val Thr lie Thr Ser ser Gly Gin Asp val Gly val 
485 490 495 

Ala Ala Ala Leu Ala Lys Thr Leu Glu Tyr Tyr Ala Ala Lys Ser Gly 
500 505 510 

Asp Thr Ala Ser Arg Asp Leu Ala Lys Gly Leu Leu Asp Ser Met Trp 
515 520 525 

Asn Asn Asp Gin Asp Ser Leu Gly Val Ser Thr Pro Glu Thr Arg Thr 
530 535 540 

Asp Tyr Ser Arg Phe Thr Gin val Tyr Asp Pro Thr Thr Gly Asp Gly 
545 550 555 560 

Leu Tyr lie Pro Ser Gly Trp Thr Gly Thr Met Pro Asn Gly Asp Gin 
565 570 575 

lie Lys Pro Gly Ala Thr Phe Leu Ser lie Arg ser Trp Tyr Thr Lys 
580 585 590 

Asp Pro Gin Trp ser Lys Val Gin Ala Tyr Leu Asn Gly Gly Pro Ala 
595 600 605 

Pro Thr Phe Asn Tyr His Arg Phe Trp Ala Glu Ser Asp Phe Ala Met 
610 615 620 

Ala Asn Ala Asp Phe Gly Met Leu Phe Pro Ser Gly Ser Pro 
625 630 635 

<210> 10 

<211> 640 

<212> PRT 

<213> Cellulomonas fimi 
<220> 

<221> misc_feature 

<223> Cellulomonas fimi CBHB 

<400> 10 

Glu Tyr Ala Gin Arg Phe Leu Ala Gin Tyr Asp Lys lie Lys Asp Pro 
15 10 15 

Ala Asn Gly Tyr Phe Ser Ala Gin Gly He Pro Tyr His Ala val Glu 
20 25 30 



Thr Leu Met val Glu Ala Pro Asp Tyr Gly His Glu Thr Thr Ser Glu 
35 40 45 

Ala Tyr Ser Tyr Trp Leu Trp Leu Glu Ala Leu Tyr Gly Gin Val Thr 
50 55 60 

Gin Asp Trp Ala Pro Leu Asn His Ala Trp Asp Thr Met Glu Lys Tyr 
65 70 75 80 

Met He Pro Gin Ser val Asp Gin Pro Thr Asn Ser Phe Tyr Asn Pro 
85 90 95 

Asn Ser Pro Ala Thr Tyr Ala Pro Glu Phe Asn His Pro Ser Ser Tyr 
100 105 110 

Pro Ser Gin Leu Asn ser Gly lie Ser Gly Gly Thr Asp Pro lie Gly 
115 120 125 

Ala Glu Leu Lys Ala Thr Tyr Gly Asn Ala Asp val Tyr Gin Met His 
130 135 140 

Trp Leu Ala Asp val Asp Asn lie Tyr Gly Phe Gly Ala Thr Pro Gly 
145 150 155 160 

Ala Gly Cys Thr Leu Gly Pro Thr Ala Thr Gly Thr ser Phe lie Asn 
165 170 175 

Thr Phe Gin Arg Gly Pro Gin Glu Ser Val Trp Glu Thr val Pro Gin 
180 185 190 

Pro Ser cys Glu Glu Phe Lys Tyr Gly Gly Lys Asn Gly Tyr Leu Asp 
195 200 205 

Leu Phe Thr Lys Asp Ala Ser Tyr Ala Lys Gin Trp Lys Tyr Thr Ser 
210 215 220 

Ala ser Asp Ala Asp Ala Arg Ala val Glu Ala Val Tyr Trp Ala Asn 
225 230 235 240 

Gin Trp Ala Thr Glu Gin Gly Lys Ala Ala Asp Val Ala Ala Thr Val 
245 250 255 

Ala Lys Ala Ala Lys Met Gly Asp Tyr Leu Arg Tyr Thr Leu Phe Asp 
260 265 270 



Lys Tyr Phe Lys Lys lie Gly Cys Thr ser Pro Thr Cys Ala Ala Gly 
275 280 285 



Gin Gly Arg Glu Ala Ala His Tyr Leu Leu Ser Trp Tyr Met Ala Tro 
290 295 300 

Gly Gly Ala Thr Asp Thr ser Ser Gly Trp Ala Trp Arg He Gly Ser 
305 310 315 320 

Ser His Ala His Phe Gly Tyr Gin Asn Pro Leu Ala Ala Trp Ala Leu 
325 330 335 

Ser Thr Asp Pro Lys Leu Thr Pro Lys ser Pro Thr Ala Lys Ala Asd 
340 345 350 

Trp Ala Ala ser Met Gin Arg Gin Leu Glu Phe Tyr Thr Trp Leu Gin 
355 360 365 

Ala ser Asn Gly Gly lie Ala Gly Gly Ala Thr Asn Ser Trp Asp Gly 
370 375 380 



385 Tyi " Gln Pr ° ™° Ala Gly Thr Pr ° Phe Tyr Gly Met G - v 



390 



395 



400 



Tyr Thr Glu Ala Pro Val Tyr val Asp Pro Pro Ser Asn Arg Trp Phe 
405 410 415 

Gly Met Gin Ala Trp Gly val Gin Arg val Ala Glu Leu Tyr Tyr Ala 
420 425 430 

Ser Gly Asn Ala Gin Ala Lys Lys He Leu Asp Lys Trp val Pro Trp 
435 440 445 

Val Y a l Ala Asn Ile Ser Thr AS P G ly Al a Ser Trp Lys Val Pro Ser 
450 455 460 

Glu Leu Lys Trp Thr Gly Lys Pro Asp Thr Trp Asn Ala Ala Ala Pro 
465 470 475 480 

Thr Gly Asn Pro Gly Leu Thr Val Glu Val Thr Ser Tyr Gly Gin Asd 
485 490 495 

Val Gly val Ala Ala Asp Thr Ala Arg Ala Leu Leu Phe Tyr Ala Ala 
500 505 510 

Lys ser Gly Asp Thr Ala ser Arg Asp Lys Ala Lys Ala Leu Leu Asp 
515 520 525 

Ala ll% Trp Ala Asn Asn Gln AS P Pro Leu Gl y val Ser Ala Val Glu 
530 535 540 



Thr Arg Gly Asp Tyr Lys Arg Phe Asp Asp Thr Tyr val Ala Asn Gly 



545 550 555 560 

Asp Gly lie Tyr lie Pro Ser Gly Trp Thr Gly Thr Met Pro Asn Glv 
565 570 575 

Asp val He Lys Pro Gly val Ser Phe Leu Asp He Arg Ser Phe Tyr 
580 585 590 

Lys Lys Asp Pro Asn Trp ser Lys Val Gin Thr Phe Leu Asp Gly Gly 
595 600 605 

Ala Glu Pro Gin Phe Arg Tyr His Arg Phe Trp Ala Gin Thr Ala Val 
610 615 620 

Ala Gly Ala Leu Ala Asp Tyr Ala Arg Leu Phe Asp Asp Gly Thr Thr 
625 630 635 640 

<210> 11 

<211> 642 

<212> PRT 

<213> Thermobifida fusca 
<220> 

<221> misc_feature 

<223> Thermobifida fusca E6 

<400> 11 

Ser Tyr Asp Gin Ala Phe Leu Glu Gin Tyr Glu Lys lie Lys Asp Pro 
1 5 10 15 

Ala ser Gly Tyr Phe Arg Glu Phe Asn Gly Leu Leu Val Pro Tyr His 
20 25 30 

ser val Glu Thr Met He val Glu Ala Pro Asp His Gly His Gin Thr 
35 40 45 

Thr ser Glu Ala Phe Ser Tyr Tyr Leu Trp Leu Glu Ala Tyr Tyr Gly 
50 55 60 

Arg val Thr Gly Asp Trp Lys Pro Leu His Asp Ala Trp Glu ser Met 
65 70 75 80 

Glu Thr Phe lie lie Pro Gly Thr Lys Asp Gin Pro Thr Asn ser Ala 
85 90 95 

Tyr Asn Pro Asn Ser Pro Ala Thr Tyr lie Pro Glu Gin Pro Asn Ala 
100 105 110 

Asp Gly Tyr Pro ser Pro Leu Met Asn Asn val Pro val Gly Gin Asp 
115 120 125 



Pro Leu Ala Gin Glu Leu ser Ser Thr Tyr Gly Thr Asn Glu lie Tyr 
130 135 140 

Gly Met His Trp Leu Leu Asp Val Asp Asn Val Tyr Gly Phe Gly Phe 
145 150 155 160 

Cys Gly Asp Gly Thr Asp Asp Ala Pro Ala Tyr lie Asn Thr Tyr Gin 
165 170 175 

Arg Gly Ala Arg Glu Ser Val Trp Glu Thr lie Pro His Pro Ser Cys 
180 185 190 

Asp Asp Phe Thr His Gly Gly Pro Asn Gly Tyr Leu Asp Leu Phe Thr 
195 200 205 

Asp Asp Gin Asn Tyr Ala Lys Gin Trp Arg Tyr Thr Asn Ala Pro Asp 
210 215 220 

Ala Asp Ala Arg Ala val Gin Val Met Phe Trp Ala His Glu Trp Ala 
225 230 235 240 

Lys Glu Gin Gly Lys Glu Asn Glu lie Ala Gly Leu Met Asp Lys Ala 
245 250 255 

Ser Lys Met Gly Asp Tyr Leu Arg Tyr Ala Met Phe Asp Lys Tyr Phe 
260 265 270 

Lys Lys lie Gly Asn Cys val Gly Ala Thr ser cys Pro Gly Gly Gin 
275 280 285 

Gly Lys Asp Ser Ala His Tyr Leu Leu , Ser Trp Tyr Tyr Ser Trp Gly 
290 295 300 

Gly ser Leu Asp Thr ser Ser Ala Trp Ala Trp Arg lie Gly Ser Ser 
305 310 315 320 

Ser ser His Gin Gly Tyr Gin Asn val Leu Ala Ala Tyr Ala Leu Ser 
325 330 335 

Gin val Pro Glu Leu Gin Pro Asp Ser Pro Thr Gly val Gin Asp Trp 
340 345 350 

Ala Thr Ser Phe Asp Arg Gin Leu Glu Phe Leu Gin Trp Leu Gin Ser 
355 360 365 



Ala Glu Gly Gly lie Ala Gly Gly Ala Thr Asn Ser Trp Lys Gly Ser 
370 375 380 



Tyr Asp Thr Pro Pro Thr Gly Leu Ser Gin Phe Tyr Gly Met Tyr Tyr 
385 390 395 400 

Asp Trp Gin Pro Val Trp Asn Asp Pro Pro Ser Asn Asn Trp Phe Gly 
405 410 415 

Phe Gin Val Trp Asn Met Glu Arg val Ala Gin Leu Tyr Tyr val Thr 
420 425 430 

Gly Asp Ala Arg Ala Glu Ala lie Leu Asp Lys Trp Val Pro Trp Ala 
435 440 445 

lie Gin His Thr Asp val Asp Ala Asp Asn Gly Gly Gin Asn Phe Gin 
450 455 460 

val Pro Ser Asp Leu Glu Trp Ser Gly Gin Pro Asp Thr Trp Thr Gly 
465 470 475 480 

Thr Tyr Thr Gly Asn Pro Asn Leu His Val Gin val Val Ser Tyr Ser 
485 490 495 

Gin Asp val Gly val Thr Ala Ala Leu Ala Lys Thr Leu Met Tyr Tyr 
500 505 510 

Ala Lys Arg Ser Gly Asp Thr Thr Ala Leu Ala Thr Ala Glu Gly Leu 
515 520 525 

Leu Asp Ala Leu Leu Ala His Arg Asp Ser lie Gly lie Ala Thr Pro 
530 535 540 

Glu Gin Pro Ser Trp Asp Arg Leu Asp Asp Pro Trp Asp Gly Ser Glu 
545 550 555 560 

Gly Leu Tyr Val Pro Pro Gly Trp Ser Gly Thr Met Pro Asn Gly Asp 
565 570 575 

Arg lie Glu Pro Gly Ala Thr Phe Leu Ser lie Arg Ser Phe Tyr Lys 
580 585 590 

Asn Asp Pro Leu Trp Pro Gin Val Glu Ala His Leu Asn Asp Pro Gin 
595 600 605 

Asn val Pro Ala Pro lie Val Glu Arg His Arg Phe Trp Ala Gin val 
610 615 620 



Glu lie Ala Thr Ala Phe Ala Ala His Asp Glu Leu Phe Gly Ala Gly 
625 630 635 640 



Ala Pro 



